Breast cancer histopathological images classification based on deep semantic features and gray level co-occurrence matrix

Breast cancer is regarded as the leading killer of women today. The early diagnosis and treatment of breast cancer is the key to improving the survival rate of patients. A method of breast cancer histopathological images recognition based on deep semantic features and gray level co-occurrence matrix (GLCM) features is proposed in this paper. Taking the pre-trained DenseNet201 as the basic model, part of the convolutional layer features of the last dense block are extracted as the deep semantic features, which are then fused with the three-channel GLCM features, and the support vector machine (SVM) is used for classification. For the BreaKHis dataset, we explore the classification problems of magnification specific binary (MSB) classification and magnification independent binary (MIB) classification, and compared the performance with the seven baseline models of AlexNet, VGG16, ResNet50, GoogLeNet, DenseNet201, SqueezeNet and Inception-ResNet-V2. The experimental results show that the method proposed in this paper performs better than the pre-trained baseline models in MSB and MIB classification problems. The highest image-level recognition accuracy of 40×, 100×, 200×, 400× is 96.75%, 95.21%, 96.57%, and 93.15%, respectively. And the highest patient-level recognition accuracy of the four magnifications is 96.33%, 95.26%, 96.09%, and 92.99%, respectively. The image-level and patient-level recognition accuracy for MIB classification is 95.56% and 95.54%, respectively. In addition, the recognition accuracy of the method in this paper is comparable to some state-of-the-art methods.


Introduction
Cancer has always been a serious threat to human life and health [1]. In 2020, there were 19 most obvious changes in the latest cancer data in the world in 2020 is the rapid growth of new cases of breast cancer. It has replaced lung cancer as the world's largest cancer (2.26 million cases of breast cancer and 2.2 million cases of lung cancer) [2]. Early diagnosis and effective treatment of breast cancer is the key to improving the survival rate of patients. Compared with X-rays, mammography, magnetic resonance and other diagnostic methods, histopathological images can provide more important basis for cancer diagnosis and are considered as the gold standard for breast cancer diagnosis. The Whole Slide Image (WSI) of histopathological images usually with a large size, ranging from 40000 to 100000 pixels. Manual diagnosis of histopathological images is time-consuming, labor-intensive and error prone, which depends on the degree of concentration and fatigue of pathologists, it also requires a lot of prior knowledge and diagnostic experience. Misdiagnosis of breast cancer can cause very serious consequences, especially when a patient with malignant tumor is diagnosed as a benign, which may lead to missing the best time for diagnosis and treatment, and even makes the patients lose their lives. At present, the number of histopathological images produced every day is numerous, and the number of experienced pathologists is far from enough, which has seriously hindered the early diagnosis of breast cancer. In order to solve these problems, researches on Computer Aided Diagnosis (CAD) emerge in endlessly. CAD can not only improve the efficiency of diagnosis, but also reduce the workload of pathologists while providing more objective diagnosis results.
Some researchers employ handcrafted features for breast cancer histopathological images recognition. Spanhol et al. [3] proposed a publicly available breast cancer histopathology dataset called BreaKHis, six kinds of features including GLCM were used for the classification of this dataset, and the accuracy range from 80% to 85%. Belsare et al. [4] extracted features such as GLCM, graph run length matrix and Euler number for breast cancer histopathological images recognition. In our previous work [5], we explored the application of 9 feature descriptors such as GLCM in breast cancer histopathological image recognition. Anuranjeeta et al. [6] proposed a breast cancer recognition method based on morphological features. 16 morphological features were extracted, and 8 classifiers were used and the accuracy is about 80%. Sharma et al. [7] first segmented the nuclei region of the images and the parameter-free threshold adjacency statistics (PFTAS) features were extracted, then random forest (RF) was used for benign and malignant classification of breast cancer histopathological images. Carvalho et al. [8] used phylogenetic diversity indexes to characterize the types of breast cancer. Boumaraf et al. [9] fused Zernike moments features, Haralick features, and color histogram features for binary and eight classes classification of breast cancer histopathological images.
In recent years, the excellent performance of deep learning in images recognition has aroused the interest of a large number of researchers, especially CNNs. The evolution of these models prompted researchers to develop CNNs based CAD models and apply them to cancer diagnosis, such as breast cancer, lung cancer, prostate cancer, cervical cancer and liver cancer. In a broad sense, there are two kinds of CNNs, namely, the CNNs trained from scratch [10][11][12][13][14][15] and the pre-trained CNNs [16][17][18][19][20][21]. Spanhol et al. [10] trained a CNN with different image patches generation strategies from scratch based on a variant of AlexNet [22], and combined these patches for final classification of the BreaKHis dataset. Considering the cost of computation, the problem of convergence and the shortage of high quality labeled histopathological images, it is not the most practical strategy to train the model from scratch. In addition, the training of the model may be very time-consuming for a large number of data due to hardware constraints. To save training time, the authors of [10] also used pre-trained CNN to extract Deep Convolutional Activation Features (DeCAF), and then learned classifiers for new classification tasks [16]. The experimental results proved that the recognition accuracy of deep learning is significantly higher than their previous work [3]. Compared with the CNN trained from scratch, transfer learning from pre-trained models provides better initialization weights than random initialization, which speeds up the training of the models. Moreover, transfer learning makes it possible to build deep networks based on a small amount of data.
Many researchers also use CNN as a feature extractor to classify the extracted deep features using different machine learning methods. For example, Li et al. [23] used ResNet50 [24] to extract the features of image patches with different sizes, 3-norm was used for feature fusion, and finally used SVM for classification. Kausar et al. [25] extracted deep features by VGG16 [26] based on images obtained from Haar wavelet decomposition, and combined different features of the middle layers for breast cancer recognition. Saxena et al. [27] employed the pretrained ResNet50 and the kernelized weighted extreme learning machine to analyze the histopathological images and solved the class imbalance problem. Man et al. [28] proposed an unsupervised anomaly detection with generative adversarial networks (AnoGAN) to screen mislabeled patches and used pre-trained DenseNet121 [29] to extract multi-layered features of the discriminative patches for breast cancer classification. Saini et al. [30] first used a deep convolution generative adversarial network for data augmentation of benign samples, and then extracted the features of different pooling layers with a variant of pre-trained VGG16, and SVM was used for classification. Li et al. [31] proposed an interleaved DenseNet with SENet (IDSNet), which used the output of the three transition layers and the fourth dense block of DenseNet121 as the input of SENet. Combined with the classification sub-network, the extracted features were cascaded for the breast cancer benign and malignant classification. Wang et al. [32] proposed a parallel dual channels model which can extract convolution features (semantic information) and capsule features (spatial information) simultaneously, the fused features were used for breast cancer recognition. The highest accuracy achieved on BreaKHis dataset at 100× was 94.52%. Murtaza et al. [33,34] used the pre-trained AlexNet as the baseline model to extract deep features and analyzed the classification performance of six machine learning methods. Shallu and Rajesh [35] compared and analyzed the deep features extracted by pre-trained VGG16, VGG19, and ResNet50, as well as the color histogram, Hu invariant moments, and GLCM on the classification performance of breast cancer histopathological images. Experimental results show that using the pre-trained network as a feature extractor outperforms the baseline models and handcrafted features.
Part of the works explored the breast cancer histopathological images classification task with a magnification independent manner. Based on the BreaKHis dataset, Benhammou et al. [36] made a comprehensive review from four aspects: MSB, MIB, magnification specific multicategory (MSM) and magnification independent multi-category (MIM) classifications. Sharma et al. [37,38] trained a simple 6-layer CNN model without considering magnifications based on BreaKHis, while performed independently in the test phase to determine the ability of the model in classifying the data based on the magnifications. Yari et al. [39] constructed 6 deep models with different parameter settings based on ResNet50 and DenseNet161, the highest accuracy of the MIB classification was 99.26%. Liu et al. [40] introduced Bilinear Convolutional Neural Networks (BCNNs) and compared with several other deep learning methods. The accuracy of the MIB classification is 99.24%. Boumaraf et al. [41] proposed a pre-trained ResNet18 with global contrast normalization method to automated classify breast cancer histopathological images, including MSB, MSM, MIB and MIM classifications.
However, there are still several deficiencies. First of all, the dataset was divided into the training set and the test set according to the images while not the patients in some works. It did not consider that the images of the same patient cannot be used for training and testing at the same time, so as to obtain higher recognition accuracy. Secondly, in the research of using deep learning models as feature extractors, some of them only consider the features of the fully connected layers or pooling layers, and rarely consider the features of the convolutional layers, which leads to the loss of spatial information. In addition, the data augmentation had been applied in many works, which will increase the amount and time of calculation. Considering the above problems, in this paper, the dataset was randomly divided into 70%/30% (56 patients/26 patients) under the condition that patients used to build the training set were not used for the test set. Pre-trained DenseNet201 was used to extract the convolutional layer features for breast cancer histopathological images recognition without data augmentation. Our contributions are as follows: 1. A breast cancer histopathological images recognition model with fusion of deep semantic features and GLCM features is designed, which fully utilizes the complementarity of semantic features and texture features.
2. The three-channel (R, G, B) features for GLCM is considered, which are more discriminative than grayscale features.
3. The features of deep convolution layers are extracted as deep semantic features, which retain more spatial and structural information of the images.
4. By using the pre-trained models, the demand of the model for labeled samples is reduced, which avoids the complex process of image labeling.

Proposed method
DenseNet was proposed by Huang et al. [29]. They broke away from the stereotyped thinking of deepening the network layers (ResNet) and widening the network width (Inception) to improve the network performance, and constructed a new network structure from the perspective of features. The network structure is not complex, but very effective. DenseNet is a convolutional neural network with dense connections. In this network, the input of each layer is the union of the outputs of all previous layers, and the feature map learned by this layer will be directly transmitted to all subsequent layers as input. When CNN model is used as a feature extractor, texture features of images often come from shallow layers, including corner, edge, etc., which describe local changes of images. With the deepening of the network, the extracted features become more and more abstract, which are called semantic features and describe the global structure of images. However, the existing methods mostly emphasize the features of the fully connected layers or the pooling layers, and rarely consider the features of the convolutional layers. The nuclei of breast tumors are larger than that of normal tissues, and the nuclei are densely distributed. Compared with the features of the pooling layers and the fully connected layers, the features of the convolutional layers contain more spatial information and provide more information about the distribution of nuclei, which is of great significance for the accurate detection of breast cancer. GLCM is a common method to describe the texture of images by studying its spatial correlation characteristics. In 1973, Haralick et al. [42] proposed using GLCM to describe texture features. The excellent ability of GLCM in breast cancer histopathological images recognition, especially for the three-channel features of the images have been discovered in [5]. In this paper, three-channel features are considered. We calculate GLCM at 0, p 4 , p 2 , 3p 4 four directions with gray level of 256 and step of 1. Then, according to the GLCM, 22 features [42][43][44] were calculated, including autocorrelation, contrast, correlation in two forms, cluster prominence, cluster shade, dissimilarity, energy, entropy, homogeneity in two forms, maximum probability, sum of squares, sum average, sum variance, sum entropy, difference variance, difference entropy, normalized inverse difference, normalized inverse difference moment and information measures of correlation in two forms.
Given the GLCM of an image, p(i,j) is the (i,j)th entry in a normalized GLCM. p x (i) is the ith entry in the marginal-probability matrix obtained by summing the rows of p(i,j). N g is the number of distinct gray levels in the quantized image. μ is the mean value of the normalized GLCM. The mean value and standard deviation for the rows and columns of the matrix jÞ, respectively. The marginal-probability distribution represents as p x ðiÞ ¼ pði; jÞ, p y ðjÞ ¼ pði; jÞ, p xþy ðkÞ ¼ The equations of the 22 features are as follows: Corrm ¼ homogeneity Homom ¼ sum entropy Senth ¼ À difference entropy Denth ¼ À p xÀ y ðiÞlogðp xÀ y ðiÞÞ; ð18Þ information measures of correlation where HXY ¼ À X i X j pði; jÞlogðpði; jÞÞ, HX and HY is the entropy of p x and p y .
In this paper, a method of breast cancer histopathological images recognition based on deep semantic features and three-channel GLCM features is presented. The framework is shown in Fig 1. On the one hand, the original images are separated into R, G, and B channels, and the GLCM features of the three channels are extracted respectively. On the other hand, the original images are resized to 224×224, and then input to the pre-trained DensNet201 to extract the deep semantic features. Here, the output of the 1×1 convolutional layer in the 4th, 6th, 14th, 19th, 22nd, and 23rd blocks in the last dense block are extracted as the deep features. Concatenate the obtained three-channel GLCM features and deep semantic features, SVM is used to classify benign and malignant breast cancer.

Dataset
The BreaKHis dataset [3] contains biopsy images of benign and malignant breast tumors, which were collected through clinical studies from January 2014 to December 2014. During the period, all patients with clinical symptoms of breast cancer were invited to the Brazilian P&D laboratory to participate in the study. Samples were collected by surgical open biopsy (SOB) and stained with hematoxylin and eosin. These images can be used for histological studies and marked by pathologists in the P&D laboratory. The BreaKHis dataset consists of 7909 breast tumor tissue microscopic images of 82 patients, divided into benign and malignant tumors, including 2480 benign (24 patients) and 5429 malignant (58 patients). Each type is further divided into four subclasses. The type benign consist of adenosis (A), fibroadenoma (F), phyllodes tumor (PT) and tubular adenoma (TA) and the type malignant consist of ductal carcinoma (DC), lobular carcinoma (LC), mucinous carcinoma (MC) and papillary carcinoma (PC). The images are obtained in a threechannel RGB (red-green-blue) true color space with magnifications of 40×, 100×, 200×, 400×, and the size of each image is 700×460.

Implementation details
All of the experiments were conducted on a platform with an Intel Core i7-5820K CPU and 16G random access memory. The BreaKHis dataset has been randomly divided into a training set (70%, 56 patients) and a test set (30%, 26 patients). We guarantee that patients used to build the training set are not used for the test set. Similar to the protocol proposed by Spanhol et al. [3], the dataset was randomly arranged into five folds. The results presented in this work are the average of five trials.
As for MIB classification, we hope to realize the recognition of breast cancer histopathological images without considering the magnifications. The training set for MIB classification is composed of training sets of 40×, 100×, 200×, and 400×, the same for the test set.
All the images we used for GLCM features were without any preprocessing. Since different network structures require different sizes of input, to compare with different baseline models, here we resized the images to 224×224, 227×227, 299×299. Among them, the input size of VGG16, ResNet50, GoogLeNet, and DenseNet201 is 224×224, the input size of AlexNet, SqueezeNet is 227×227, and the input size of Inception-ResNet-V2 is 299×299. The baseline models are all well pre-trained on ImageNet and Nvidia GeForce GTX 1080Ti GPU was used for model training. The stochastic gradient descent (SGD) method was used to fine-tune the weights of the entire network for the seven models, the momentum factor is 0.9. The initial learning rate was set as 0.0001 to avoid distorting the initial pre-trained weights as they have been already well tuned. We trained our model for 6 epochs with the minimum batch size of 10 images. The cross-entropy was adopted as the loss function. Taking 40× in fold1 as an example, the accuracy and loss curves of DenseNet201 are given in Fig 3. The images used for deep feature extraction were also resized to 224×224 in order to reduce the calculation while making a fair comparison with the baseline models. For the SVM, we chose the RBF kernel. The best penalty factor c = 2 and kernel function parameter g = 1 were obtained by cross validation.

Evaluation metrics
We report the recognition accuracy at both the image-level and the patient-level. For the image-level, let N rec_I be the number of images correctly classified, N represents all the test samples, then the recognition accuracy of the image-level can be defined as For the patient-level, we followed the definition of [3]. Let N P be the images of patient P, S is the total number of patients, and N rec_P images of patient P were correctly classified, then the patient score can be defined as and define the recognition accuracy of the patient-level as To further assess the performance of the proposed framework, sensitivity (Se), precision (Pr) and F1_score metrics were used and the formulations of the metrics are described as Pr ¼ where true positive (TP) represents the number of malignant samples classified as malignant, true negative (TN) represents the number of benign samples classified as benign. Also, false positive (FP) represents the number of benign samples incorrectly classified as malignant while false negative (FN) represents the number of malignant samples misclassified as benign.

Results
In this part, we separately discussed the results of MSB classification and MIB classification for breast cancer histopathological images recognition.

Magnification specific binary classification
Firstly, through comparative analysis, we find that the features of 1×1 convolutional layer in the 4th, 6th, 14th, 19th, 22nd and 23rd blocks in the last dense block are more discriminative.
For the sake of description, we use the following naming method: Dense Block4_block4_1 means to extract the output of the 1×1 convolutional layer in the 4th block of the 4th dense block as features. In this paper, we extracted the features of the following convolutional layers: Dense Block4_block4_1, Dense Block4_block6_1, Dense Block4_block14_1, Dense Block4_-block19_1, Dense Block4_block22_1, Dense Block4_block23_1, which are short as block4, block6, block14, block19, block22, block23 in the following description. Table 2 shows the comparison of classification performance of convolutional layer features with pooling layer features and fully connected layer features. Comparing the classification performance of different layer features under four magnifications, it can be found that the performance of fully connected layer features is worse than that of pooling layer features and deep convolutional layer features, and the recognition accuracy of four magnifications are all less than 90%. The recognition accuracy of the features of the pooling layers at 400× is significantly lower than that of the convolutional layers. This is because the images at 400× contain more accurate lesion information, which is often local information. The pooling operation loses part of the spatial information of the images, which makes cancer detection more difficult. Compared with the fully connected layer and pooling layer features, the convolutional layer features retain the spatial and structural information of the images. It can be seen from Table 2 that the convolutional layer features perform well for the images under four magnifications. The recognition accuracy are all higher than 90% except for block4 and block6 at 400×, and the performance of block14, block19 and block23 is better. In addition, it is verified by experiments that with the deepening of the network, the features become more and more abstract, the classification performance showed a downward trend. So the convolutional layer features deeper than block23 are no longer considered here. Table 3 shows the classification performance of fused features of deep semantic features and GLCM features for MSB classification. Comparing Tables 2 and 3, it can be found that the classification performance based on the combination of deep semantic features and GLCM is significantly better than the classification performance of deep semantic features. The highest recognition accuracy at the image-level is 96.75%, 95.21%, 96.57%, and 93.15% for 40×, 100×, 200×, 400×, respectively. And the highest recognition accuracy at the patient-level is 96.33%, 95.26%, 96.09%, and 92.99%, respectively. Compared with pooling layer and fully connected layer features, the fused features of convolutional layer features and GLCM achieve higher accuracy, as shown in Fig 4. Although the recognition accuracy of Average pool_3 is better than that of block4 and block6, its recognition time is about 15-20 times that of convolutional layer features. Based on the above conclusion, the fused features of convolutional layer features and GLCM for breast cancer histopathological images recognition is discussed below. The Receiver Operating Characteristic (ROC) curves of classification performance of different feature combinations are shown in Fig 5. In order to investigate the classification performance of different deep semantic features and GLCM, we use t-distributed stochastic neighbor embedding (t-SNE) to visualize deep semantic features, GLCM features and the fused features, as shown in with only a small number of samples interlaced. In addition, comparing different magnifications, it can be found that the features of 200× has the best separability, so as to obtain the highest recognition accuracy.
To further illustrate the effectiveness of the proposed method, we compared the performance of seven pre-trained baseline models for breast cancer binary classification, as shown in Table 4. It can be seen from Table 4 that under the same training conditions, as a whole, GoogLeNet and VGG16 performs well under the four magnifications. VGG16 obtained the highest imagelevel recognition accuracy of 93.87% and the highest patient-level recognition accuracy of 93.15% at 200×. Compared with the performance of DenseNet201, ResNet50 performs better at 40×, 100× and 200×, SqueezeNet performs better at 100× and 200×, and Inception-ResNet-V2 performs better at 100×. AlexNet performs worse than other baseline models, and the performance of DenseNet201 is at an intermediate level. All models perform best at 200×, followed by 100×, indicating that the images with these two magnifications not only contain enough global information, but also contain rich local information, which are more suitable for automatic breast cancer recognition.
In this paper, what we want to achieve is the fusion of deep semantic features and GLCM features. At first, we need to ensure the depth of the network to extract effective deep semantic features. And then the amount of network parameters and the dimension of the extracted

PLOS ONE
features are considered. DenseNet201 has a deeper structure and fewer parameters than Alex-Net, VGG16, ResNet50 and Inception-ResNet-V2. For example, in VGG16, with the increase of layers, the dimension of the extracted convolution layer features continues to increase according to the characteristics of network structure, which far exceeds the dimension of GLCM, so that the role of GLCM in the fused features is ignored, resulting in much worse recognition results for fused features than a single VGG16. DenseNet201 not only makes full use of information from different layers, but also limits the dimension of features through 1×1 convolution operations. Deep semantic features and GLCM features give full play to their respective advantages, so as to achieve better recognition results. Therefore, we chose Dense-Net201 as the feature extractor.
The recognition accuracy of all baseline models is lower than the method proposed in this paper. The comparison results are shown in Fig 7.

Magnification independent binary classification
In this section, we discuss the results of MIB classification. Table 5 shows the results of MIB classification.
Regardless of the magnifications, the recognition accuracy of the method proposed in this paper is still acceptable, especially for block14+GLCM, the image-level recognition accuracy is 95.56%, and the patient-level recognition accuracy is 95.54%, followed by block23+GLCM, the image-level and the patient-level recognition accuracy are 95.23% and 95.10%, respectively. The ROC curves of the recognition performance of different fused features are shown in Fig 8. Table 6 shows that the performance of the baseline models for MIB classification. It can be seen from Table 6 that GoogLeNet performs best among the models, followed by ResNet50 and DenseNet201. The performance of the method proposed in this paper is significantly better than the baseline models. Comparing Tables 4 and 6, it can be found that the baseline models are not sensitive to the magnifications. The recognition accuracy does not fluctuate much whether the magnification is considered or not.   Table 7 is a comparison between the method in this paper and the state-of-the-art methods. Works [45,46] divided the dataset according to the protocol of [3], works [31,47] divided the dataset according to the patients, the author in [48] divided the dataset according to the images in the image-level classification, the authors in [39,40] divided the dataset according to the images, and works [49][50][51] did not mention whether to divide the dataset according to the patients or the images. It can be seen from Table 7 that the classification performance of the methods which dividing the dataset according to the images is significantly better than our method. The recognition accuracy of our methods is significantly higher than other methods that dividing the dataset according to the patients except for [46,50], but there is still room for improvement.

Discussion
From the experimental results, we can see that the method proposed in this paper is very effective in classifying the breast cancer for both the MSB classification and MIB classification. Compared with the pooling layer features and the fully connected layer features, the convolutional layer features retain more spatial and structural information of the images, and show better separability, which is beneficial to the recognition of breast cancer. In addition, we discussed the classification problem of breast cancer which does not depend on magnifications. It does not need to consider the magnification of the images in this method, and avoids the trouble of training multiple models with different magnifications, which ensures the high accuracy of recognition while improving the efficiency of the model training. It is meaningful in practical application. Given an unlabeled image, what we need to do is to identify whether it is benign or malignant, while without considering its magnification. A commonly used method in existing works is model training based on image patches. Firstly, the original images need to be divided into small image patches, the labels of the image patches are predicted one by one, and then the labels of the image patches are integrated to predict the image label. There is a problem with this method. For a malignant image, the malignant tissues are not full of the whole image, it often contains some benign tissues. Using the label of the original image as the label of the image patches cannot guarantee the consistency of the label and often reduces the recognition accuracy. It is very time-consuming for getting image patches, and the classification performance of the model also depends on the size of the image patches. There are also some researchers who use data augmentation to increase the diversity of the samples when training the model, but for pre-trained models, we only need to adjusted some of the parameters. Although we did not consider data augmentation, there is no over-fitting problem in our method.
In view of this, we used the original images for GLCM feature extraction, and used the resized images to fine-tune the pre-trained models. GLCM provides the texture features of the images, DenseNet201 makes full use of the features of different layers, and the features of the deep convolutional layer retain more spatial information of the images. These features are complementary to each other and achieve better recognition performance.

Conclusion
In this paper, a breast cancer histopathological images classification method based on the fusion of DenseNet201 deep semantic features and three-channel GLCM features is proposed. Unlike other methods that only consider the features of the pooling layers and the fully connected layers of the CNN models, we explored the discriminative ability of different deep convolutional layer features, and fused the extracted deep semantic features with three-channel GLCM features for breast cancer histopathological images MSB and MIB classification. For the four magnifications, the highest recognition accuracy of the image-level is 96.75%, 95.21%, 96.57%, 93.15%, respectively, and the highest recognition accuracy of the patient-level is 96.33%, 95.26%, 96.09%, 92.99%, respectively. The accuracy of the image-level and the patientlevel for MIB classification is 95.56% and 95.54%, respectively. Experimental results show that the method proposed in this paper is robust to the two classification problems. The comparison results with seven baseline models indicate that the performance of the method proposed in this paper is better.
In the future work, we will continue to study the multi-class recognition of breast cancer histopathological images and realize the sub-class recognition of breast cancer, which can provide more accurate theoretical basis for pathologists, and to further reduce their workload.